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1.  Introduction 


Body  sensors  show  promise  in  healtheare  and  the  greater  biomedical  sector.  Mobility  and 
portability  are  desirable  for  wearable  sensors  to  increase  personal  care  in  diagnosis,  prevention, 
and  response.  A  key  challenge  to  this  mobility  lies  in  the  lack  of  “real-world  neuroimaging”  tools 
that  can  accurately  and  appropriately  collect  low  signal-to-noise  ratio  (SNR)  data  outside  of 
controlled  conditions  such  as  specialized  laboratories  or  clinical  environments.*  The 
development  of  such  tools  will  enable  optimization  of  brain-computer  interactive  technologies 
(BCIT)  in  a  fieldable  form  factor.^ 

The  major  barrier  for  achieving  such  a  device  that  can  handle  low  SNR  data  and  wirelessly 
transmit  the  information  is  power  consumption.  Current  batteries  add  substantial  bulk  and 
weight,  restrict  application  space,  prevent  implementation  of  implantable  devices  to  the  system, 
and  also  introduce  maintenance  issues  and  lifetime  costs.  Energy  harvesting  from  ambient 
energy  sources  presents  a  potentially  indefinite  solution,  but  requires  system  design  that 
consumes  less  power  than  is  harvested. 

Recent  advances  in  ultra-low  power  chip  design  techniques  have  enabled  a  push  in  the 
development  of  small  devices  capable  of  complex  computations  but  with  long  lifetime.  These 
dramatically  extended  lifetimes  are  feasible  without  the  need  for  attached  batteries  or  are 
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sustainable  using  energy  harvesting  techniques.  For  example,  Zhang  et  al.  present  a 
reconfigurable  system-on-chip  (SoC)  that  is  capable  of  processing  data,  including  wireless 
transmissions,  in  electrocardiography  (ECG),  electromyography  (EMG),  and 
electroencephalography  (EEG)  applications  that  operate  using  thermoelectrically  generated 
energy. 

While  suitable  for  measuring  ECG  signals,  the  approaches  used  to  date  are  not  sufficient  for  use 
in  EEG  measurements.  Brain-source  signals  are  significantly  smaller  (1-10  pV  as  compared  to 
millivolts)  and  have  an  extremely  low  SNR.  EEG  data  acquisition  systems  will  require  high 
sensitivity,  strong  amplification  of  the  target  signal,  and  high  bit-depth  resolution  of  the  analog- 
to-digital  converter  (ADC)  in  order  to  properly  resolve  the  small  signal  from  the  large  voltage 
fluctuations.  In  order  to  increase  system/signal  fidelity,  the  analog  front-end  (AFE)  was 
redesigned  as  seen  in  Fig.  1,  resulting  in  a  12-bit  successive  approximation  register  (SAR)/24-bit 
sigma-delta  modulator  option  with  digital  hardware  deciding  the  duty  cycling,  or  on/off,  of  the 
devices  based  on  power  requirements. 
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Fig.  1  Targeted  systems  for  redesign  in  the  analog  front  end 

The  challenge  here  is  to  design  a  low  power  digital  library  that  is  robust  across  process  comers 
while  balancing  area  consumption  with  the  speed  and  frequency  of  operation  for  the  logic  gates. 
We  explore  body  biasing  as  an  option  to  increase  the  drive  strength.  We  also  investigate  the  use 
of  transmission  gate  logic  to  improve  the  area  consumption  and  robustness  across  process 
comers  as  compared  to  static  complementary  metal-oxide  semiconductor  (CMOS) 
implementations.  We  validate  such  techniques  through  the  design  and  simulation  of  inverters, 
full  adders,  and  a  five-stage  cascaded  integrator-comb  (CIC)  filter  (inverter,  XOR,  NAND,  flip 
flop,  full  adder,  ripple  carry  adder,  26  bits). 


2.  Circuit  Topology/Gate  Design/Inverter  and  Gate  Design  Trade-Offs 


In  order  to  improve  system  power  consumption,  performance,  and  particularly  power 
consumption  during  digital  processing,  and  make  more  power  available  for  analog  computations, 
the  digital  portion  of  this  SoC  was  redesigned.  This  section  provides  a  brief  overview  of 
subthreshold  characteristics  of  digital  logic. 

2.1  Inverter 

A  body  biasing  scheme,  called  the  dynamic  threshold  metal-oxide  semiconductor  (DTMOS) 
inverter,  is  examined  and  compared  to  a  standard  (non-body  biased)  inverter.  Transmission  gate 
logic  is  compared  to  static  CMOS  gates.  We  compare  delays,  noise  margin,  and  power 
consumption  across  process  variation  and  with  Vdd  scaling.  The  library  is  used  to  develop  flip 
flops  and  full  adders.  A  CIC  fdter  is  completed  and  laid  out. 

2.2  Arbitrary  Logic  Gates 

Subthreshold  operation  exhibits  a  PUN/PDN  imbalance.  Stacking  emphasizes  the  problem  even 
more  as  the  stacking  factor  is  greater  (causes  more  imbalance)  in  subthreshold  than  above 
threshold.  We  avoid  stacking  by  using  pass  gate  or  transmission  gate  logic.  This  also  results  in 
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the  added  benefit  of  smaller  and  simpler  implementation  of  XOR/XNOR,  making  for  a  more 
modular  nature  to  implement  the  eommon  logie  gates. 

Different  1-bit  full  adder  topologies  were  studied  to  make  trade-offs  in  area,  power,  and 
propagation  delay.  Transmission  gate  logie,  also  ealled  eomplementary  pair  logie,  is  similar  to 
pass  gate  logie  in  that  the  propagation  delay  is  improved  in  exehange  for  inereased  dynamie 
power.  While  pass  gate  logie  using  n-type  metal-oxide-semieonduetor  (NMOS)  logie  has 
diffieulty  passing  a  high  signal  (p-type  metal-oxide-semieonduetor  [PMOS]  logie/low), 
transmission  gate  logie  uses  a  eomplementary  pair  to  average  the  two  deviee  performanees  for 
better  noise  margins. 

Generally,  above  threshold,  the  weakness  of  pass  gate  logie  is  a  Vt  loss  at  the  output  when 
passing  a  high  input  for  an  NMOS  or  a  low  input  for  a  PMOS.  By  operating  subthreshold,  the 
voltage  transfer  eurve  is  improved  beeause  the  drain-to-souree  voltage  requirement  to  keep  the 
deviee  in  saturation  is  redueed  to  ~3  kT/q  or  ~78  mV  at  (273  K/room  temperature).  Thus,  the 
penalties  in  noise  margin  of  transmission  gate  logie  versus  statie  CMOS  are  redueed. 

Transmission  gate  also  uses  a  eomplementary  NMOS/PMOS  to  average/make  up  for  the  loss  in 
drive  strength. 

Pass  gate  implementations  show  improved  propagation  delay  in  exehange  for  inereased  dynamie 
power.  Logie  for  a  1-bit  full  adder  ean  vary,  resulting  in  tradeoffs  in  area,  power,  and 
propagation  delay.  Pass  gate  implementations  of  the  1-bit  full  adder  show  improved  propagation 
delay  and  area  at  the  eost  of  inereased  dynamie  power.  Transmission  gate  logie  with  buffered 
outputs  use  more  area  than  pass  gates,  but  eliminate  the  undesirable  threshold  voltage  effeets. 
When  using  XOR/XNOR  and  multiplexors,  transmission  gate  is  easily  implemented  due  to  its 
modular  nature.  Pass  gate  logie  implementation  of  transmission  gate  logie  is  very  eonvenient  for 
logie  bloeks  like  a  1  -bit  full  adder. 

In  a  subthreshold  mode  of  operation,  the  weak  inversion  eurrent  has  exponential  dependeneies  on 
Vgs  and  Vt.  Consequenees  inelude  sensitivity  to  supply,  proeess,  and  temperature  variation. 
Body  biasing  is  one  solution  to  this  problem.  Body  biasing  sehemes  are  explored  as  options  to 
reduee  effeets  due  to  proeess  variation  in  Kim  et  al.  (2003).^  (This  system  explores  the  use  of 
DTMOS  inverters,  where  the  gate  and  body  of  the  metal-oxide-semieonduetor  field-effeet 
transistors  [MOSFETs]  are  eonneeted  together).  DTMOS  inverters  use  body  biasing  to  take 
advantage  of  the  exponential  on  Vj.  The  p-n  junetion  is  “forward  biased”  below  the  turn-on 
voltage,  and  foreing  the  resulting  inerease  in  inversion  eharge  leads  to  greater  gate  eapaeitanee, 
but  also  significantly  higher  current  drive  in  DTMOS  inverters. 

One  consequence  of  subthreshold  operation  is  the  reduction  of  Iqn/Ioff,  namely,  in  the  case  of 
stacked  transistors.  When  transistors  are  in  series,  their  overall  strength  is  lower  than  that  of  a 
single  transistor.  Transmission  gate  logic  is  one  solution.  Instead  of  having  different  strengths  in 
the  pull  up  and  pull  down  network,  the  circuit  has  to  face  issues  of  leakage  and  nodes  conflict. 
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3.  Simulation  Results 


Spice  simulations  were  done  on  the  proposed  topologies.  A  DTMOS  inverter  was  compared  to  a 
CMOS  inverter.  The  CMOS  inverter  was  sized  to  have  the  same  rise  and  fall  time  as  the 
DTMOS  when  loaded  with  the  same  capacitance.  The  DTMOS  used  less  active  area,  but  overall 
consumed  a  much  larger  area.  The  results  were  as  follows: 

•  DTMOS:  810  nm  x  120  nm  +  210  nm  x  120  nm  =  122400  nm^ 

•  Static:  1 .48  pm  x  120  nm  +  480  nm  x  120  nm  =  235200  nm^ 

XOR  gates  were  constructed  using  transmission  gates  with  outputs  buffered  by  inverters.  The 
voltage  transfer  characteristic  (VTC)  as  the  output  goes  from  low  to  high  and  from  high  to  low 
are  shown  in  Fig.  2.  The  inverters  reduce  the  variation  at  the  output,  by  decreasing  the  range  of 
input  voltages  where  the  output  is  indeterminate.  There  are  two  distinct  transition  points,  instead 
of  a  symmetrical  curve  due  to  the  different  threshold  values  for  the  PMOS/NMOS  in  the 
transmission  gate. 


Fig.  2  VTC  output  from  low  to  high  and  high  to  low 

A  sine  wave  signal  reference  signal  with  Vpk-pk  of  200  mV  was  processed  by  a  sigma-delta 
modulator  and  the  resulting  signal  input  to  the  26-bit  CIC  filter  to  reconstruct  the  signal.  Fig.  3  is 
a  graph  of  the  transient  power  consumption,  and  the  total  energy  consumption  of  the  CIC  filter. 
The  average  power  for  the  system  is  only  67.34  nW. 
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Fig.  3  Graph  of  the  transient  power  consumption,  and  the  total 
energy  consumption  of  the  CIC  filter 


4.  Conclusions 


The  CIC  filter  demonstrates  lower  power  consumption  than  other  systems  before  it.  However, 
while  the  active  area  consumed  is  lower  than  static  CMOS  equivalents,  the  triple  well  process  on 
a  bulk  CMOS  process  significantly  increases  the  amount  of  wasted  area.  In  a  fully  depleted  SOI, 
this  method  would  eliminate  the  need  for  triple  wells,  as  well  as  improve  the  performance  of  the 
transistors.  Therefore,  future  work  should  be  aimed  at  developing  this  digital  logic  library  at  the 
smaller  transistor  nodes. 
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List  of  Symbols,  Abbreviations,  and  Acronyms 


ADC 

analog-to-digital  converter 

AFE 

analog  front-end 

BCIT 

brain-eomputer  interactive  teehnologies 

CIC 

easeaded  integrator-eomb 

CMOS 

eomplementary  metal-oxide  semieonduetor 

DEMOS 

dynamie  threshold  metal-oxide  semieonduetor 

ECO 

eleetroeardiography 

EEC 

eleetroeneephalography 

EMG 

eleetromyography 

EY15 

fiseal  year  2015 

IC 

integrated  eireuit 

MOSEETs 

metal-oxide-semieonduetor  field-effeet  transistors 

NMOS 

n-type  metal-oxide-semieonduetor 

PMOS 

p-type  metal-oxide-semieonduetor 

SAR 

speeific  absorption  rate 

SNR 

signal-to-noise  ratio 

SoC 

system-on-ehip 

VTC 

voltage  transfer  charaeteristie 
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